44 research outputs found

    A temporal precedence based clustering method for gene expression microarray data

    Get PDF
    Background: Time-course microarray experiments can produce useful data which can help in understanding the underlying dynamics of the system. Clustering is an important stage in microarray data analysis where the data is grouped together according to certain characteristics. The majority of clustering techniques are based on distance or visual similarity measures which may not be suitable for clustering of temporal microarray data where the sequential nature of time is important. We present a Granger causality based technique to cluster temporal microarray gene expression data, which measures the interdependence between two time-series by statistically testing if one time-series can be used for forecasting the other time-series or not. Results: A gene-association matrix is constructed by testing temporal relationships between pairs of genes using the Granger causality test. The association matrix is further analyzed using a graph-theoretic technique to detect highly connected components representing interesting biological modules. We test our approach on synthesized datasets and real biological datasets obtained for Arabidopsis thaliana. We show the effectiveness of our approach by analyzing the results using the existing biological literature. We also report interesting structural properties of the association network commonly desired in any biological system. Conclusions: Our experiments on synthesized and real microarray datasets show that our approach produces encouraging results. The method is simple in implementation and is statistically traceable at each step. The method can produce sets of functionally related genes which can be further used for reverse-engineering of gene circuits

    Impact of environmental inputs on reverse-engineering approach to network structures

    Get PDF
    Background: Uncovering complex network structures from a biological system is one of the main topic in system biology. The network structures can be inferred by the dynamical Bayesian network or Granger causality, but neither techniques have seriously taken into account the impact of environmental inputs. Results: With considerations of natural rhythmic dynamics of biological data, we propose a system biology approach to reveal the impact of environmental inputs on network structures. We first represent the environmental inputs by a harmonic oscillator and combine them with Granger causality to identify environmental inputs and then uncover the causal network structures. We also generalize it to multiple harmonic oscillators to represent various exogenous influences. This system approach is extensively tested with toy models and successfully applied to a real biological network of microarray data of the flowering genes of the model plant Arabidopsis Thaliana. The aim is to identify those genes that are directly affected by the presence of the sunlight and uncover the interactive network structures associating with flowering metabolism. Conclusion: We demonstrate that environmental inputs are crucial for correctly inferring network structures. Harmonic causal method is proved to be a powerful technique to detect environment inputs and uncover network structures, especially when the biological data exhibit periodic oscillations

    Cell specific analysis of Arabidopsis leaves using fluorescence activated cell sorting

    Get PDF
    After initiation of the leaf primordium, biomass accumulation is controlled mainly by cell proliferation and expansion in the leaves1. However, the Arabidopsis leaf is a complex organ made up of many different cell types and several structures. At the same time, the growing leaf contains cells at different stages of development, with the cells furthest from the petiole being the first to stop expanding and undergo senescence1. Different cells within the leaf are therefore dividing, elongating or differentiating; active, stressed or dead; and/or responding to stimuli in sub-sets of their cellular type at any one time. This makes genomic study of the leaf challenging: for example when analyzing expression data from whole leaves, signals from genetic networks operating in distinct cellular response zones or cell types will be confounded, resulting in an inaccurate profile being generated. To address this, several methods have been described which enable studies of cell specific gene expression. These include laser-capture microdissection (LCM)2 or GFP expressing plants used for protoplast generation and subsequent fluorescence activated cell sorting (FACS)3,4, the recently described INTACT system for nuclear precipitation5 and immunoprecipitation of polysomes6. FACS has been successfully used for a number of studies, including showing that the cell identity and distance from the root tip had a significant effect on the expression profiles of a large number of genes3,7. FACS of GFP lines have also been used to demonstrate cell-specific transcriptional regulation during root nitrogen responses and lateral root development8, salt stress9 auxin distribution in the root10 and to create a gene expression map of the Arabidopsis shoot apical meristem11. Although FACS has previously been used to sort Arabidopsis leaf derived protoplasts based on autofluorescence12,13, so far the use of FACS on Arabidopsis lines expressing GFP in the leaves has been very limited4. In the following protocol we describe a method for obtaining Arabidopsis leaf protoplasts that are compatible with FACS while minimizing the impact of the protoplast generation regime. We demonstrate the method using the KC464 Arabidopsis line, which express GFP in the adaxial epidermis14, the KC274 line, which express GFP in the vascular tissue14 and the TP382 Arabidopsis line, which express a double GFP construct linked to a nuclear localization signal in the guard cells (data not shown; Figure 2). We are currently using this method to study both cell-type specific expression during development and stress, as well as heterogeneous cell populations at various stages of senescence

    Conserved noncoding sequences highlight shared components of regulatory networks in dicotyledonous plants

    Get PDF
    Conserved noncoding sequences (CNSs) in DNA are reliable pointers to regulatory elements controlling gene expression. Using a comparative genomics approach with four dicotyledonous plant species (Arabidopsis thaliana, papaya [Carica papaya], poplar [Populus trichocarpa], and grape [Vitis vinifera]), we detected hundreds of CNSs upstream of Arabidopsis genes. Distinct positioning, length, and enrichment for transcription factor binding sites suggest these CNSs play a functional role in transcriptional regulation. The enrichment of transcription factors within the set of genes associated with CNS is consistent with the hypothesis that together they form part of a conserved transcriptional network whose function is to regulate other transcription factors and control development. We identified a set of promoters where regulatory mechanisms are likely to be shared between the model organism Arabidopsis and other dicots, providing areas of focus for further research

    A specific group of genes respond to cold dehydration stress in cut Alstroemeria flowers whereas ambient dehydration stress accelerates developmental senescence expression patterns

    Get PDF
    Petal development and senescence entails a normally irreversible process. It starts with petal expansion and pigment production, and ends with nutrient remobilization and ultimately cell death. In many species this is accompanied by petal abscission. Post-harvest stress is an important factor in limiting petal longevity in cut flowers and accelerates some of the processes of senescence such as petal wilting and abscission. However, some of the effects of moderate stress in young flowers are reversible with appropriate treatments. Transcriptomic studies have shown that distinct gene sets are expressed during petal development and senescence. Despite this, the overlap in gene expression between developmental and stress-induced senescence in petals has not been fully investigated in any species. Here a custom-made cDNA microarray from Alstroemeria petals was used to investigate the overlap in gene expression between developmental changes (bud to first sign of senescence) and typical post-harvest stress treatments. Young flowers were stressed by cold or ambient temperatures without water followed by a recovery and rehydration period. Stressed flowers were still at the bud stage after stress treatments. Microarray analysis showed that ambient dehydration stress accelerates many of the changes in gene expression patterns that would normally occur during developmental senescence. However, a higher proportion of gene expression changes in response to cold stress were specific to this stimulus and not senescence related. The expression of 21 transcription factors was characterized, showing that overlapping sets of regulatory genes are activated during developmental senescence and by different stresses

    Nonparametric Bayesian inference for perturbed and orthologous gene regulatory networks

    Get PDF
    Motivation: The generation of time series transcriptomic datasets collected under multiple experimental conditions has proven to be a powerful approach for disentangling complex biological processes, allowing for the reverse engineering of gene regulatory networks (GRNs). Most methods for reverse engineering GRNs from multiple datasets assume that each of the time series were generated from networks with identical topology. In this study, we outline a hierarchical, non-parametric Bayesian approach for reverse engineering GRNs using multiple time series that can be applied in a number of novel situations including: (i) where different, but overlapping sets of transcription factors are expected to bind in the different experimental conditions; that is, where switching events could potentially arise under the different treatments and (ii) for inference in evolutionary related species in which orthologous GRNs exist. More generally, the method can be used to identify context-specific regulation by leveraging time series gene expression data alongside methods that can identify putative lists of transcription factors or transcription factor targets. Results: The hierarchical inference outperforms related (but non-hierarchical) approaches when the networks used to generate the data were identical, and performs comparably even when the networks used to generate data were independent. The method was subsequently used alongside yeast one hybrid and microarray time series data to infer potential transcriptional switches in Arabidopsis thaliana response to stress. The results confirm previous biological studies and allow for additional insights into gene regulation under various abiotic stresses. Availability: The methods outlined in this article have been implemented in Matlab and are available on request

    Perturbation of cytokinin and ethylene-signalling pathways explain the strong rooting phenotype exhibited by Arabidopsis expressing the Schizosaccharomyces pombe mitotic inducer, cdc25

    Get PDF
    Background Entry into mitosis is regulated by cyclin dependent kinases that in turn are phosphoregulated. In most eukaryotes, phosphoregulation is through WEE1 kinase and CDC25 phosphatase. In higher plants a homologous CDC25 gene is unconfirmed and hence the mitotic inducer Schizosaccharomyces pombe (Sp) cdc25 has been used as a tool in transgenic plants to probe cell cycle function. Expression of Spcdc25 in tobacco BY-2 cells accelerates entry into mitosis and depletes cytokinins; in whole plants it stimulates lateral root production. Here we show, for the first time, that alterations to cytokinin and ethylene signaling explain the rooting phenotype elicited by Spcdc25 expression in Arabidopsis. Results Expressing Spcdc25 in Arabidopsis results in increased formation of lateral and adventitious roots, a reduction of primary root width and more isodiametric cells in the root apical meristem (RAM) compared with wild type. Furthermore it stimulates root morphogenesis from hypocotyls when cultured on two way grids of increasing auxin and cytokinin concentrations. Microarray analysis of seedling roots expressing Spcdc25 reveals that expression of 167 genes is changed by > 2-fold. As well as genes related to stress responses and defence, these include 19 genes related to transcriptional regulation and signaling. Amongst these was the up-regulation of genes associated with ethylene synthesis and signaling. Seedlings expressing Spcdc25 produced 2-fold more ethylene than WT and exhibited a significant reduction in hypocotyl length both in darkness or when exposed to 10 ppm ethylene. Furthermore in Spcdc25 expressing plants, the cytokinin receptor AHK3 was down-regulated, and endogenous levels of iPA were reduced whereas endogeous IAA concentrations in the roots increased. Conclusions We suggest that the reduction in root width and change to a more isodiametric cell phenotype in the RAM in Spcdc25 expressing plants is a response to ethylene over-production. The increased rooting phenotype in Spcdc25 expressing plants is due to an increase in the ratio of endogenous auxin to cytokinin that is known to stimulate an increased rate of lateral root production. Overall, our data reveal important cross talk between cell division and plant growth regulators leading to developmental changes

    High-resolution temporal profiling of transcripts during Arabidopsis leaf senescence reveals a distinct chronology of processes and regulation

    Get PDF
    Leaf senescence is an essential developmental process that impacts dramatically on crop yields and involves altered regulation of thousands of genes and many metabolic and signaling pathways, resulting in major changes in the leaf. The regulation of senescence is complex, and although senescence regulatory genes have been characterized, there is little information on how these function in the global control of the process. We used microarray analysis to obtain a highresolution time-course profile of gene expression during development of a single leaf over a 3-week period to senescence. A complex experimental design approach and a combination of methods were used to extract high-quality replicated data and to identify differentially expressed genes. The multiple time points enable the use of highly informative clustering to reveal distinct time points at which signaling and metabolic pathways change. Analysis of motif enrichment, as well as comparison of transcription factor (TF) families showing altered expression over the time course, identify clear groups of TFs active at different stages of leaf development and senescence. These data enable connection of metabolic processes, signaling pathways, and specific TF activity, which will underpin the development of network models to elucidate the process of senescence

    Listen to genes : dealing with microarray data in the frequency domain

    Get PDF
    Background: We present a novel and systematic approach to analyze temporal microarray data. The approach includes normalization, clustering and network analysis of genes. Methodology: Genes are normalized using an error model based uniform normalization method aimed at identifying and estimating the sources of variations. The model minimizes the correlation among error terms across replicates. The normalized gene expressions are then clustered in terms of their power spectrum density. The method of complex Granger causality is introduced to reveal interactions between sets of genes. Complex Granger causality along with partial Granger causality is applied in both time and frequency domains to selected as well as all the genes to reveal the interesting networks of interactions. The approach is successfully applied to Arabidopsis leaf microarray data generated from 31,000 genes observed over 22 time points over 22 days. Three circuits: a circadian gene circuit, an ethylene circuit and a new global circuit showing a hierarchical structure to determine the initiators of leaf senescence are analyzed in detail. Conclusions: We use a totally data-driven approach to form biological hypothesis. Clustering using the power-spectrum analysis helps us identify genes of potential interest. Their dynamics can be captured accurately in the time and frequency domain using the methods of complex and partial Granger causality. With the rise in availability of temporal microarray data, such methods can be useful tools in uncovering the hidden biological interactions. We show our method in a step by step manner with help of toy models as well as a real biological dataset. We also analyse three distinct gene circuits of potential interest to Arabidopsis researchers

    Transcriptional dynamics driving MAMP-triggered immunity and pathogen effector-mediated immunosuppression in Arabidopsis leaves following infection with Pseudomonas syringae pv tomato DC3000

    Get PDF
    Transcriptional reprogramming is integral to effective plant defense. Pathogen effectors act transcriptionally and posttranscriptionally to suppress defense responses. A major challenge to understanding disease and defense responses is discriminating between transcriptional reprogramming associated with microbial-associated molecular pattern (MAMP)-triggered immunity (MTI) and that orchestrated by effectors. A high-resolution time course of genome-wide expression changes following challenge with Pseudomonas syringae pv tomato DC3000 and the nonpathogenic mutant strain DC3000hrpA- allowed us to establish causal links between the activities of pathogen effectors and suppression of MTI and infer with high confidence a range of processes specifically targeted by effectors. Analysis of this information-rich data set with a range of computational tools provided insights into the earliest transcriptional events triggered by effector delivery, regulatory mechanisms recruited, and biological processes targeted. We show that the majority of genes contributing to disease or defense are induced within 6 h postinfection, significantly before pathogen multiplication. Suppression of chloroplast-associated genes is a rapid MAMP-triggered defense response, and suppression of genes involved in chromatin assembly and induction of ubiquitin-related genes coincide with pathogen-induced abscisic acid accumulation. Specific combinations of promoter motifs are engaged in fine-tuning the MTI response and active transcriptional suppression at specific promoter configurations by P. syringae
    corecore